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Abstract 



The study investigated inservice teachers’ assessment competency as a function of 
measurement training and years of teaching. Data were collected from 3 1 1 teachers on a 67- 
items Assessment Practices Inventory (API). The reliability of the API was supported by a 
high Cronbach alpha of .97. Construct validity of the API was examined using Rasch model 
procedure. Seven composite scoires were formed based on the underlying dimensions from a 
principal factor analy<?is. A 2x3 MANOVA was conducted to examine the effects of 
measurement training and years of teaching on teachers’ percdved competency in the seven 
assessment categories as reflected in the composite scores. Multivariate interaction effects 
between measurement training and years of teaching were significant (p <.0S). Subsequent 
examination revealed significant multivariate simple effects of measurement training at four or 
more years of teaching in two factor analyzed assessment categories (p < 0 1). Follow up 
comparisons between means indicated that among teachers who had taught four or more 
years, those with measurement training scored significantly higher than those without 
measurement training on standardized test results interpretation/classroom statistics and using 
assessment results in decision making (p <.001). Among teachers who had taught four or 
more years, those with measurement training scored significantly higher than those without 
measurement training on performance assessment and informal observation (p <.05). 
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Introduction 



For decades educational outcomes were measured and controlled through standardized 

paper pencil tests. The focus of assessment research was on large-scale testing in which 
measurement spedalists played a nuyor role. Paradoxically, large-scale testing constitutes 
only 1% of all the assessment events; whereas the other 99% of assessments are conducted 
by classroom teachers (Stiggins, 1991c). The situation was challenged by educational reform 
as we moved into the 1990s. As part of the educational reform movement cognitive 
psycholo^sts and educators began to emphasize the importance of problem solving skills and 
higher order thinking skills which are not measured by simple recall items in paper-pencil 
tests, hi addition, the need to measure students’ ability to apply knowledge in real life 
situations, to cooperate with others, and to do hands-on activities gave rise to a new 
emphasis on alternative assessment methods. These changes also called for the shift of 
importance to classroom assessment. 

The changes in the focus of assessment in the 1990s support the educational reform 
movement of teaching for learning. Central to these clianges is the growing role of classroom 
assessment. To keep up with the new changes, classroom assessment techniques ought to 
receive more emphasis in preservice and inservice measurement traiiung (Stiggins, 1991a, 
1991b). There needs to be a better understanding of the nature and characteristics of 
classroom assessment (Stig^ns, 1992a). Teachers' assessment competency should be 
ei'aluated in relation to measurement training and teaching experience (Zhang, 1995). 

Theoretical Framework of the Study 

Research indicates that teachers spend up to 50% of their classroom time in 
assessment related activities (Sdggins, 1991a). This points to the importance of the need for 
a high level competency of assessment skills for classroom teachers. A few large-scale 
studies have been conducted in recent years to investigate teacher competence of educational 
assessment. These studies have invariably come to the conclusion that teachers' preparation 
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for testing and measurement is inadequate (Jett & Schafer, 1992; Newman & Stallings, 1982; 
Marso & Pigge, 1989; Plake, 1993; Plake, Impara, & Pager, 1993; Wise, Lukin, & Roos, 
1991). The solutions proposed in these studies included strengthening measurement training 
for preservice and inservice teachers and requiring an assessment component in teacher 
certification. 

What assessment skills should be included in a testing and measurement course to 
ensure that preservice and inservice teachers are adequately trained? The answer to this 
question will delineate the content domain of classroom assessment skills. Drawn from 
measurement textbooks and published literature, Schafer (1991) specified eight content areas 
in which teachers need to develop assessment skills; 

1 . Basic concepts and terminology of assessment. Teachers should be well 
acquainted with assessment concepts and terms such as objective tests, alternative 
assessments, formatiye versus summative assessment, criterion-referenced versus norr 
referenced testing and grading, validity, and reliability (Airasian, 1994; Carey, 1994; Shafer, 
1991). 

2. Uses of assessment. Assessment can be used for planning and evaluating 
instAiction, diagnosing problem areas, monitoring progress, grouping students, assigning 
grades, developing curriculum, and evaluating school improvement (Burry-Stock, 1995; 
Stiggins, 1987). 

3. Assessment planning and development. To plan assessment, the teacher needs to 
consider students’ ability level, specify assessment targets, and choose approriate assessment 
methods (Airasian, 1994; Carey, 1994; Stiggins, 1992b). Writing objectives and designing 
test specifications are two important devices that can be used at assessment planning stage to 
ensure validity and reliability of assessment (Burry-Stock, 1995; Zhang & Iran-Nejad, 1993). 
Three major assessment methods are paper-pencil tests, performance measures, and informal 
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assessment (Stiggins, 1992b). The first two methods are considered formal and the last one 
informal (Airasian, 1994). 

4. Interpretation of assessments. To interpret properly standardized test results, 
teachers need to have a good understanding of the meaning and limitations of standard scores 
such as percentile rank, grade equivalent score, normal curve equivalent, and stanine. They 
should understand the concept of standard error of measurement and be able to use that 
concept in explaining a percentile band-of-error. In addition, teachers should know how to 
judge the adequacy of test norms and use norms to evaluate individuals’ performance 
(Airasian, 1994; Carey, 1994; Shafer, 1991). 

5. Description of assessment results. This involves the application of basic statistical 
analysis to classroom assessment data. 

6. Evaluation and improvement of assessments. The teacher should be able to use 
assessment data to analyze the quality of a test (validity and reliability) and test items (item 
analysis) (Carey, 1994; Gregory, 1992). 

7. Feedback and grading. When assigning grades, teachers need to make three 
decisions; the grading model to be used (criterion-referenced versus norm-referenced), 
performance components to be included in grades (achievement-related versus non- 
achievement-related), and the weight each component should receive (Airasian, 1994). 
Criterion-referenced grades indicate mastery level of teaching objectives whereas norm- 
referenced grades suggest an individual’s relative standing in a class. To produce valid 
grades, teachers should only include achievement-related components in the calculation of 
final grades. Assessment components that reflect effort, ability, attitude, and motivation 
should not be included in subject matter grades because they are hard to define and measure 
(Stiggins, Frisbie, & Griswold, 1989). Asses-sment components that provide more 
information about students’ academic achievement and learning (e.g., final exam) should 
receive more weights (Airasian, i994; Carey, 1994). The methods that can be used to 
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integrate different assessment components to determine the final grades include percentage 
method, weighted method, and maximum point method (Carey, 1994). When called upon, 
teachers should explain to students, parents, and other educational personnel about the 
meaning, implication, and limitation of their grading system used to determine grades. 

8. Ethics of assessment. Teachers should guard against overuse and misuse of 
assessment results. Students’ right to fair testing and confidentiality should be protected 
(Plaice, 1993; Schafer, 1991). 

These assessment skills summarized the expectations of the assessment community 
for classroom teachers. In 1990, the American Federation of Teachers (AFT), National 
Council on Measurement in Education (NCME), and National Education Association (NEA) 
issued the seven Standards for Teacher Competence in Educational Assessment of Students 
(Appendix A), ne seven Standards incorporated the various assessment skills addressed in 
measurement textbooks and assessment literature. According to the seven Standards, 
teachers should be skilled in 

1 . choosing assessment methods, 

2. developing assessment methods, 

3. administering, scoring, and interpreting assessment results, 

4. using assessment results in decision making, 

5. grading, 

6. communicating assessment results, and 

7. ethics in assessment. 

The se^'en standards were intended to guide teacher evaluation and measurement 
training. The assessment skills addressed in the literature and the seven Standards formed the 
theoretical foundation for the present study. 
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Problems in Classroom Assessments 



Research suggests that teachers are not well prepared to meet classroom assessment 
demands (Stiggins, 1991a) due to inadequate training in measurement (Goslin, 1967; Hills, 
1991; Jett & Schafer, 1992; O’Sullivan & Chalnick, 1991; Roeder, 1972; Wolmut, 1988), 
Problems can be found in various aspects of classroom assessment. 

Stiggins (1992a) conducted qualitative research with a group of teachers teaching 
math, science, speaking, and writing in second, fifth, eighth, and eleventh grades. Seventy- 
eight percent of the teachers reported using performance assessment in their classroom. 
However, one third of these teachers did not define levels of performance or plan scoring 
procedures in advance, nor did they i.Jbrm students of performance evaluation criteria. 

About 50% of the teachers did not record their scoring during assessment. These practices 
were not in line with recommended performance assessment methods. 

From their survey with 130 preservice teachers and 1 19 inservice teachers, 
Vanleirsburg and Johns (1991) concluded that approximately 40% of the teachers studied did 
not know that standardized tests and teacher-made tests should differ in terms of 
administration. Twenty percent of the inservice teachers and 26% of the preservice teachers 
disagreed that it was vital to read directly from the test manual when administering a 
standardized test. Hall and Kleine (1992) defined nonstandard testing practices as teaching 
test items, increasing time limits, giving hints, and changing students’ answers. Fifty-five 
percent of the teachers in their study reported engaging in these inappropriate practices. The 
pressure for teachers to raise standardized test scores through means other than instructional 
improvement was also widely reported by the teachers in another study by Nolen, Haladyna 
and Haas (1992). In addition, most teachers had trouble understanding and interpreting 
standard scores such as percentile rank and grade equivalent score (Hills, 1991). Impara, 
Divine, Bruce, Liverman, and Gay (1991) reported, based on their experimental study, that 
even with the help of interpretive information, most teachers were weak in interpreting a 
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percentile band performance profile. Marso and Pigge (1988) concluded from their 
investigation that teachers were not well trained to use f ^andardized tests in their classroom. 

Problems with grading are just as abundant. Wiggins (1988) pointed out that grading 
criteria used by teachers were arbitrary and mysteries, often a function of teacher taste rather 
than a representation of inherent and tangible standards. Based on the results of a case study 
of IS high school teachers’ grading practices, Stiggins, Frisbie, and Griswold (1989) 
discovered that 80% of the teachers felt that effort, attitude, and motivation should be 
considered in grading, 50% of the teachers studied incorporated ability into grades, S0% of 
the teachers studied did not distinguish between formative and summative data and thus did 
not use weights to reflect the relative importance of assessment components, they ga'.'e little 
attention to assessment errors, and all teachers used subjective nonachievement factors to 
make decisions about borderline cases. 

The common practice of incorporating effort and attitude into grades was also 
criticized in other studies (Hills, 1991; Jongsma, 1991). Griswold (1993) studied 326 
teachers’ grading decisions for two borderline cases. He discovered that the vast majority of 
the participants made effort-based judgments about grades. Griswold suggested that teachers 
incorporated social-cognitive learning and attribution theories into their grading belief system 
and practice when they graded on nonachievement factors such as efforts and attitude. It was 
not clear, however, how teachers operationally define and assess aptitude and other intrinsic 
dispositions such as effort and attitude. 

In sum, previous assessment research has been conducted along two lines; the 
standards for teacher assessment competency and problems in classroom assessment. Since 
classroom assessment is a broad area, most studies concentrated on one aspect of classroom 
assessment. Few researchers have examined teachers’ assessment competency against the 
seven Standards. The present study differs from most other studies in that it is based on the 
theoretical framework of assessment literature and the seven Standards by the AFT, NCME, 
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and NEA (1990). The purpose of the study is to investigate the effects of measurement 
training and teaching experience on teachers’ perceived assessment competencies in the entire 
arena of classroom assessment. In particular, the researchers are interested in finding out if 
inservice teachers would respond differently to the API due to difference in measurement 
training and years of training. 

Methodology 

Sample 

The sampls was made up of 3 1 1 inservice teachers. Two hundred and sixty inservice 
teachers came from two local school districts in Alabama: Tuscaloosa County School and 
Tuscaloosa City School. The numbers of elementary, middle school/junior high, and high 
schools participating in the study were 6, 4 , and 6, respectively. A vocational school was 
also included in the data collection. This was done to ensure a balanced representation of 
teachers from different grade levels. Another S 1 inservice subjects were students enrolled in 
the graduate courses of Assessment of Classroom Learning (HER 550) and Research 
Methods in Education (BER 500) at the University of Alabama, and Elementary Science 
Teaching Methods (CEE 5 1 5) at the University of Alabama-Gasden Center during the spring 
1995 semester. The inservice subjects were also primarily white (89%) and female (77.4%). 
The percentages of elementary, junior high/middle school, and high school teachers were 34, 
23, and 30, respectively. The remaining 13% were for comprehensive and other types of 
schools. Forty percent of these teachers obtained a bachelor’s degree, another 56% had a 
Master’s degree. About 28% of the inservice teachers had had one measurement course, 

45% of the inservice subjects had taken 2-3 measurement courses. The average number of 
years of teaching was 10.9. 

Data from inservice teachers were collected in March, 1995. The instrument, 
together with a cover letter and computer scanable answer sheet, was distributed to the 
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teachers by their school principal at a faculty meeting. Since directions were clearly provided 
in written form, participants responded to the instrument on their own. Those who 
voluntarily responded to the instrument returned the completed answer sheets to the school 
secretary. The return rate was approximately 30%. 

Instrumentation 

The instrument used in the study was the Assessment Practices Inventory (API) 
designed by Zhang and Burry-Stock ( 1 994). After three pilot studies and numerous 
revisions, the current version of the API has 67 items each of which describes a classroom 
assessment practice. The API was developed according to the seven Standards for Teacher 
Competence in Educational Assessment of Students (AFT, NCME, & NEA, 1990). The 
statistical analysis of the data from 3 1 1 inservice teachers yielded a Cronbach alpha 
coefficient of .97, indicating the instrument had a high measure of reliability. All item-to- 
total correlations were above .37 wiih the highest one being .69. The standard error of 
measurement was 7.7. 

The content validity of the instrument was built into the construction process by 
developing the items according to the seven Standards for Teacher Competence in 
Educational Assessment of Students. The construct validity of the API was examined using 
Rasch model (Rasch, 1960) computer program BIGSTEPS (Linacre & Wright, 1994). The 
item calibration was estimated for each of the 67 items and then located along a continuum. 
The distribution of item logits from -.89 to 1 .3 1 (Appendix B) indicated that the items, to a 
degree, defined the theoretical construct “perceived skill level” of classroom assessment for 
inservice teachers (Wright & Stone, 1979). 

Data Analysis 

Principal Factor Analysis . To identify the underlying dimensions of the API, a 
principal factor analysis was conducted with principal axis method of extraction (squared 
multiple correlations were used on the diagonals of the correlation matrix) and a vaiimax 
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orthogonal rotation. Based on eigenvalues greater than one criterion (Kaiser, 1960), seven 
factors were retained. The seven factors accounted for 5 1 .91% of the variance. The seven 
factors and their percent of variance were: 

1 . Develop and administer paper-pencil tests, choose/revise tests for classroom 
use (12.24), 

2. Interpret .standardized test results/classroom statistics, use assessment results in 
decision making (10. 1), 

3. Develop and use performance assessment, informal assessment (S.3 1), 

4. Communicate test results (6.84), 

5 . Non- Achievement based grading (5.91), 

6. Ethics in assessment (4.43), and 

7. Grading (4.07). 

Appendix C shows rank ordered factor loadings of individual items for a seven factor 
solution. For each factor, only high loadings (greater than .31) selected by the computer 
program are presented. The final communality estimates for each item, the sum of squared 
factor loadings for each factor, and the percent of variance explained by each factor are also 
presented. 

2x3 MANOVA. Based on the factor analysis, the 67 items were classified into seven 
assessment categories each of which was represented by a factor. A composite score was 
calculated for each person by summing up all the numbers of the scaled responses to the 
items loading high on that factor. The seven composite scores served as the dependent 
variables. The two independent variables were measurement training (no training, at least 
one measurement course) and years of teaching (one or less year of teaching, two-three years 
of teaching, four or more years of teaching). A 2x3 MANOVA was then conducted on the 



Using GLM in S AS, the results of the 2x3 MANO VA were checked in the following 
way. First, multivariate interaction effects were checked; if multivariate interaction effects 
were significant, multivariate simple effects were checked; if multivariate simple effects were 
significant, univariate simple effects were checked (Littell, Freund, & ctor, 1991). 

Results 

Table 1 shows significant overall cel! effects (F= 1.61, p=.0146), suggesting there 
were significant multivariate interaction effects between measurement training and years of 
teaching. Subsequent examinations of the multivariate simple effects indicated significant 
multivariate simple effects of training at four or more years of teaching (F=3.3, p=.0022). 
The multivariate simple effects are presented in Table 2. 



Table 1 



A ivLrtjNuv/\ on me uaia. uveraj 

Assessmciit Practices in Seven Catceories 


11 i^eii cuecis 01 Measuremem i raining dv rears or leacmng on 
N=311 


Value 


F Value p Value 


Wilks’ Lambda .82 


1,61 .0146* 



♦ significant at alpha=.05 



Table 2 



Years of Tcachine on Assessment Practices in Seven CatCEorics 


N=31I 




Value 


F Value 


E Value 


Wilks* Lambda .92 


3.3 


.0022** 



** significant at alpha=.01 
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Examinations of the univariate simple effects of measurement training at four or more 
years of teaching in the seven assessment categories revealed significant univariate simple 
effects in category two (interpret standardized test results/classroom statistics, use 
assessment results in decision making) (F= 12.74, u=.0004) and category three (develop and 
use performance assessment, informal assessment) (F=4.26, p=.0399). The results are 
presented in Table 3. 



Tables 

A 2x3 MANQVA of the API Data: Univariate Simple Effects of Measurement Training at Four or More 
Years of Teaching on Assessment Practices in Seven Categories N=31I 



Assessment Categorj' 


F Value 


P Value 


1. Develop and administer papcr-pcncii tests, 
cboose/revisc tests for classroom u. 


1.52 


.2187 


2, Interpret standardized lest rcsulis/classroom statistics, 
use assessment results in decision milking. 


12.74 


.0004** 


3. Develop and use performance assessment, 
informal assessment. 


4.26 


.0399 * 


4. Communicate test results. 


2.76 


.0977 


5. Non-Achievement based grading. 


.19 


.6672 


6. Ethics in assessment. 


2.58 


.1091 


7. Grading. 


.22 


.6391 



* significant at alp>ia=.05 



♦♦ significant at alpha=.01 



Follow up comparison between means indicated that among teachers who had taught 
four or more years, those with measurement training scored significantly higher than those 
without measurement training in category two (46.93 versus 39.46, F=12.74, p=.0004). 
Among teachers who had taught four or more years, those with measurement training scored 
significantly higher than those without measurement training in category three (40.86 versus 
37.90, F=4.26, p=.0399). 

Discussion and Conclusion 

The analysis of the data suggested that among teachers who had taught for four or 
more years, those who had received measurement training were more skilled in interpreting 
standardized test results, conducting classroom statistics, and using assessment results in 
decision making (assessment category II) than those who had not received any measurement 
training. It was also suggested that among teachers who had taught for four or more years, 
those who had received measurement training were more skilled in using performance 
assessment and informal observation (assessment category 111) than those who had not 
received any measurement training. 

Zhang’s research using Rasch model analysis (1995) indicated that interpreting 
standardized test results, conducting classroom assessment, using assessment results in 
decision making were perceived by inservice teachers to be the hardest assessment category. 
According to Impara at el. (1991), teachers had trouble interpreting a percentile band 
performance profile even with the help of interpretive information. The present research 
findings suggested that it was in this assessment category that measurement training and 
teaching experience worked together most effectively to enhance teachers' assessment 
competency. 

Zhang’s study (1995) also suggested that performance assessment and informal 
observation were considered by inservice teachers to be harder than paper-pencil tests. 
Teachers in general were less proficient in performance assessment and they often failed to 
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follow the recommended practices in instrument construction, assessment recording, and 
assessment criteria communication (Stiggins, 1992a). Yet, it is in this category that teachers 
with measurement training and teaching experience (four or more years) were more skilled 
than those without measurement training and with less teaching experience. These two 
findings testified the value of measurement training and teaching experience. 

In conclusion, the research findings of the present study provided evidence 
concerning the effects of measurement training and teaching experience in two assessment 
categories that were normally considered to be difficult by inservice teachers. One possible 
explanation of the interaction effects is that measurement training provides the principles and 
techniques needed for effective classroom assessment and teaching/testing in the classroom 
gives the teacher an opportunity to practice and use the book knowledge in a hands-on 
approach. The length of time suggested here (four or more years of teaching) is consistent 
with the findings from expert teaching literature that teachers develop their expertise through 
a series of stages as they move fi'om novice to experts (Berliner, 1987). Future study should 
focus on exploring the assessment areas in which experienced and novice teachers differ and 
use that information to guide preservice and inservice measurement training. 

Since the self-report inventory was used only with 3 1 1 inservice teachers mainly &om 
two local school districts, the present research findings should be interpreted with caution. 
The replication of the study with a larger sample is desired to confirm the present research 
findings. 
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Appendix A 

The Seven Standards for Teacher Competence of Educational Assessment of Students 



Teacbeis sbouldtw skilled in choosing a ss es s m ent mctbods appxopitate for 
Instiactloiial dedsloas. 



Skills in choosing appropriate, useful, adminisiralively convenient, fechnicaUy adequate, and fair assessrnent methods are 
prerequisite to good use of information to support instructional decisions. Teacfttrs need to be well-acquainted with the 
kinds of infonnation provided by a broad range of assessment alematives and their strengths and weaknesses. In 
part/cular. they should be familiar with cnltria tor evakiating and selecting assessment methods in light of instructional 
plans. 

Teachers who meet mis standard will have me conceptual and appication skills that follow. They win be able to use the 
cofx»pts of assessment error and vaidty when developing or selecting their approaches to cfassroom assessment of 
students. They will understand how valid assessment data can support insmictionaJ activities such as providing 
appropriate feedback to students, dagnosing group and individual learning needs, ptenninQ for indviduafired educstional 
programs, motivating students, and evaluating instniciional procedures. They will understand how invald irrionrftation can 
affect instructional decisions about students. They will also be able to use and evaluate asscssmerri options available to 
them, considering among other mings. the cultural, sodaJ, economic, and language backgrounds of students. They will 
be aware that dffereni assessment approaches can be incompatible with certain instiuctionai goals and may impact quite 
differently on meir teaching. 

Teachers vwl! know, for each assessment approach they use, its appropriateness (or making decisions about their pupils. 
Moreover, teachers vwll know of where to find information about and/or reviews of various assessment methods. 
Assessment options are dverse and include text- and cuniculum-tmbedded questior» and tests, standardized criterion- 
referenced and norm-referenced tests, oral questioning, spontaneous and st/ucfured performance assessments, 
portfolios, exhibitions, demonstrations, rating scales, writing samples, paper-and-pendl tests, seatwork and homework, 
peer- and self-assessments, student records, observations, queslionnaifts. interviews, projects, products, and others’ 
opinions. 




Teachers should be skilled In developing assessment methods approptiate for 
instiuctionai dedsloiis. 



While teachers often use pubfished or other external assessment tools, the bulk of the assessment information they use 
for dedsion-making comes from approaches they create and implement. Indeed, the assessmertt demands of the 
classroom go well beyord readily available instrumerus. 

Teachers who meet this sundard will have the conceptual and appficaiion skills that follow. Teachers will be skilled in 
planning the collection of information that fadStaies the decisions they vrill make. They will know and follow appropriaie 
prindptes for developing and using assessment methods in their teaching, avoiding common put alls m student 
assessment. Such techniques may induce several ot the options fisted at the end oi the first standard. The teacher will 
select the techrtiques which are appropnaic to the intent of the teachers instruction. 

Teachers meeting this standard will also be skilled in using student data to arwiyze the quality of each assessment 
technique they use. Since most teachers do not have access to assessment speoausts. they must De prepared to do 
these analyses themseNes. 
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The temcher should be skilled In rndmlulstexiiig. scorlnf iziterpretlja^ the 
results of both extcnuDy-produced and tescher-produced assessment methods. 



It is nor enough that teadiers are able to select and develop good assessment methods; they must also be able to apply 
them property. Teachers shoukl be skilled in administering, scoring, and interpreting results from diverse assessment 
methods. 

Teachers who meet this standard will have the conceptual and application skills mat follow. They will be skilled in 
interpreting informal and formal teach er*produced assessment results, including pupils* performances in class and on 
homework assignments. Teachers will be able to use guides tor sconng essay questions and projects, stendis tor scoring 
response-choice questions, and scales for rating performance assessments. They wiB be able to use these in ways mat 
produce consistent results. 

Teachers will be able to administer standardized achievement tests and be able to interpret the commonly reponed 
scores: percentile ranks, percentile band scores, standard scores, and grade equivalents. They will have a conceptual 
understanding of the summary indexes commonly reported with assessment results: measures of central terxfency. 
dispersion, retatior^hips, reliability, arxl errors of measurement. 

Teachers will be able to apply these concepts of score and summary indices in ways that enhance their use of me 
assessments that they develop. They will be able to analyze assessment results to identity pupils' strengths and errors, if 
they get inconsisterrt results, they will seek other explanations for me discrepancy or other data to attempt to resolve the 
uncertainty before arriving at a decision. They will be able to use assessment methods in ways that encourage students 
educatlonaJ development and that do not inappropriately increase students' anxiety levels. 




Teachers should be skilled In usin^ assessment results when makts^ dedsiocs 
about individual students, planning teaching, developing cuniculum, and 
school ImpTovcment, 



Assessrnent results are used to make educational decisions at several levels: in the classroom about students, in the 
community about a school and a school district, and In society, generally, about me purposes and outcomes of the 
educational enterprise. Teachrrs play a vital role when participating in dedstornnaking at each of these levels and must be 
able to use assessment resufts effectively. 

Teachers who meet this standard will have the conceptual and application skills that follow. They will be able to use 
accumulated assessment information to organize a sound instnjctional plan for facilitating students' educational 
development. When using ^sessment results to plan and/or evaluate instruction and cum*cukjm. teachers will interpret 
me results correctly and avoid common misinterpretations, such as basing decisions on scores that lack curriculum validHy. 
They will be Informed about the results of local, regional, state, arxf national assessments and about their appropriate use 
for pup), classroom, school, district, state, arxf national educational improvement. 




Xesichcrs should be skilled lu developing valid pupil gradixig procedures which 
lise pupn assessments. 



Grading studenis is an important paa of proiessionai practice for teachers. Grading is defined as indicating both a 
student's level of periormarKte and a teacher's valuing of mat performance. The principles lor using assessments to obtain 
vaBd grades are known and teachers should employ them. 

Teachers who meet this standard will have the conceptual and application skills that follow. They will be able to devise, 
implemeni* and explain a procedure tor developing grades composed of marks from various assignments, proiecis. in* 
class aaivilies. quizzes, tests, and/or other assessments that they may use. Teachers will urxlcrstand and be able to 
articulate why the grades they assign are rational, justified, and fair, acknowledging mat such grades reflect their 
preferences and judgments. Teachers will be able lo recognize and to avoid faulty gradir^g procedures such as using 
grades as punishment, ijiey will be able to evaluate and lo modily their grading procedures in order to improve the validity 
of the interpretations made from them about students' attainments. 
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Teachers should be sldDed In comnnmlcatiiig assessment results to students, 
parents* other lay audiences, and other educators^ 



M 



Teachers must routinely report assessment results to students and to parents or guardians. In addition, they are 
frequently asked to rep(^ or to discuss assessment results with other educators and with diversa lay audiences, it the 
results are not communicated effectively, they may be misused or r»t used. To oomnnunicate effectively with others on 
matters of student assessment* teachers rrust be able to use assessmera terminoiogy appropriately and must be able to 
articulate the meaning. Imitations, and impScations of assessmem results. Furthermofe, teachers will sometimes be in a 
position that will require them to defend their own assessment proMdures and their Inter pretaiions of them. At other 
times, teachers may need to http the pubic to interpret assessment rtsuKs appropriately. 

Teachers who meet this standard will have the conceptual and appication stalls that foliow. Teachers will understand and 
be able to give appropriate explanations of how the interpretation of student assessments must be moderated by tne 
studenrs socio-economic, cultural, language, and other background factors. Teachers will be able to explain that 
assessment results do not imply that such background factors limit a students ultimate educational development. They 
will be abla to comrrxjnicate to students and to their parents or guanfians how they nay assess the student’s educational 
progress. Teachers will understand and be able to explain the importance of taking measurement errors into account 
when using assessments to make ded&ons about individual students. Teachers will be able to explain the Bmiations of 
different informal and formal assessment methods. They will be able to explain printed reports of the results of pupil 
assessments ai me classroom, school district, state, and national levels. 




Teacheis should be skilled in reco^nizisg illegal, and otherwise 

ajiA useS Of ISSeSSlXieXlt ixifoidation. 



Fairness, the rights of ail concerrid. and professional ethical behavior must undergird aii student assessment activities, 
from the initial planning for and gathering of information to the interpretation, use. end communication of the results. 
Teachers must be well-versed in their own ethical and legal responsibilities In assessment. In addition, they should also 
attempt to have the inappropriate assessment practices of others discontinued whenever they are encountered. 
Teachers should also participle with the wider educational community in defining me imrts of appropriate professional 
behavior in assessmem. 

Teachers who meet this standard will have the conceptual and appScatici skilU that follow. They will know those laws and 
case decisions which affect their classroom* school dstrict* and state assessment practices. Teachers will be aware that 
various assessment procedures can be misused or overused resulting in harmful oonsequences such as embarrassing 
students, violating a studerrrs right to confidereiatity. and inappropriately using saidents' standardized achievement test 
scores to measure teaching effectiveness. 
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Appi^ndix B 



Rasch Model Analysis of the API Data: Locale Item Calibrations Along a Continuum N=3 1 1 
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Appendix C 

Principal Factor Analysis of the API Data: A Seven-Factor Solution With a Varimax Rotation N=311 



Final 





Item 


Factor I Factor 11 Factor 111 


Factor rv Factor V Factor VI Factor VII Cominunality 

Eblimates 


1 


14 


.73 


.62 
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12 


.72 


.66 


3 


15 
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.59 
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13 
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Appendix C Continued 





Item 


Factor I Factor II Factor III 


Factor IV 


Factory 


Factor VI 


Final 

Factor VII Commuaaiity 

Estimates 
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Sum of Squared 8.20 
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